OcrV1, Main, Exploration, bibRecord, 000013

Improvement of a text detection chain and the proposition of a new evaluation protocol for text detection algorithms

Identifieur interne : 000013 ( Main/Exploration ); précédent : 000012; suivant : 000014

Improvement of a text detection chain and the proposition of a new evaluation protocol for text detection algorithms

Auteurs : Stefania Calarasanu [France]

Source :

RBID : Hal:tel-01318351

Descripteurs français

mix :
- Comparaison d'algorithmes, Détection de texte, Earth mover's distance, Métriques de performance, Protocole d'évaluation, Rectification de texte, Texte en perspective, Visualisation par histogrammes.

English descriptors

mix :
- Evaluation protocol, Performance metrics, Text detection.

Abstract

The growing number of text detection approaches proposed in the literature requires a rigorous performance evaluation and ranking. An evaluation protocol relies on three elements: a reliable text reference, a matching strategy and finally a set of metrics. The few existing evaluation protocols often lack accuracy either due to inconsistent matching or due to unrepresentative metrics. In this thesis we propose a new evaluation protocol that tackles most of the drawbacks faced by currently used evaluation methods. This work is focused on three main contributions: firstly, we introduce a complex text reference representation that does not constrain text detectors to adopt a specific detection granularity level or annotation representation; secondly, we propose a set of matching rules capable of evaluating any type of scenario that can occur between a text reference and a detection; and finally we show how we can analyze a set of detection results, not only through a set of metrics, but also through an intuitive visual representation. A frequent challenge for many Text Understanding Systems is to tackle the variety of text characteristics in born-digital and natural scene images for which current OCRs are not well adapted. For example, texts in perspective are frequently present in real-word images because the camera capture angle is not normal to the plane containing the text regions. Despite the ability of some detectors to accurately localize such text objects, the recognition stage fails most of the time. In this thesis we also propose a rectification procedure capable of correcting highly distorted texts evaluated on a very challenging dataset.

Url:

https://tel.archives-ouvertes.fr/tel-01318351

Affiliations:

France

Links toward previous steps (curation, corpus...)

to stream Hal, to step Corpus: 000064
to stream Hal, to step Curation: 000064
to stream Hal, to step Checkpoint: 000001
to stream Main, to step Merge: 000013
to stream Main, to step Curation: 000013

Le document en format XML

<record><TEI><teiHeader><fileDesc><titleStmt><title xml:lang="en">Improvement of a text detection chain and the proposition of a new evaluation protocol for text detection algorithms</title>
<title xml:lang="fr">Amélioration d'une chaîne de détection de texte et proposition d'un nouveau protocole d'évaluation d'algorithmes de détection de texte</title>
<author><name sortKey="Calarasanu, Stefania" sort="Calarasanu, Stefania" uniqKey="Calarasanu S" first="Stefania" last="Calarasanu">Stefania Calarasanu</name>
<affiliation wicri:level="1"><hal:affiliation type="laboratory" xml:id="struct-10947" status="VALID"><orgName>Laboratoire de Recherche et de Développement de l'EPITA</orgName>
<orgName type="acronym">LRDE</orgName>
<desc><address><addrLine>LRDE, EPITA 14-16, rue Voltaire F-94276 Le Kremlin Bicêtre cedex France</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.lrde.epita.fr</ref>
</desc>
<listRelation><relation active="#struct-305456" type="direct"></relation>
</listRelation>
<tutelles><tutelle active="#struct-305456" type="direct"><org type="institution" xml:id="struct-305456" status="INCOMING"><orgName>Ecole Pour l'Informatique et les Techniques Avancées</orgName>
<desc><address><country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
</affiliation>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">HAL</idno>
<idno type="RBID">Hal:tel-01318351</idno>
<idno type="halId">tel-01318351</idno>
<idno type="halUri">https://tel.archives-ouvertes.fr/tel-01318351</idno>
<idno type="url">https://tel.archives-ouvertes.fr/tel-01318351</idno>
<date when="2015-12-11">2015-12-11</date>
<idno type="wicri:Area/Hal/Corpus">000064</idno>
<idno type="wicri:Area/Hal/Curation">000064</idno>
<idno type="wicri:Area/Hal/Checkpoint">000001</idno>
<idno type="wicri:Area/Main/Merge">000013</idno>
<idno type="wicri:Area/Main/Curation">000013</idno>
<idno type="wicri:Area/Main/Exploration">000013</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title xml:lang="en">Improvement of a text detection chain and the proposition of a new evaluation protocol for text detection algorithms</title>
<title xml:lang="fr">Amélioration d'une chaîne de détection de texte et proposition d'un nouveau protocole d'évaluation d'algorithmes de détection de texte</title>
<author><name sortKey="Calarasanu, Stefania" sort="Calarasanu, Stefania" uniqKey="Calarasanu S" first="Stefania" last="Calarasanu">Stefania Calarasanu</name>
<affiliation wicri:level="1"><hal:affiliation type="laboratory" xml:id="struct-10947" status="VALID"><orgName>Laboratoire de Recherche et de Développement de l'EPITA</orgName>
<orgName type="acronym">LRDE</orgName>
<desc><address><addrLine>LRDE, EPITA 14-16, rue Voltaire F-94276 Le Kremlin Bicêtre cedex France</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.lrde.epita.fr</ref>
</desc>
<listRelation><relation active="#struct-305456" type="direct"></relation>
</listRelation>
<tutelles><tutelle active="#struct-305456" type="direct"><org type="institution" xml:id="struct-305456" status="INCOMING"><orgName>Ecole Pour l'Informatique et les Techniques Avancées</orgName>
<desc><address><country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
</affiliation>
</author>
</analytic>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc><textClass><keywords scheme="mix" xml:lang="en"><term>Evaluation protocol</term>
<term>Performance metrics</term>
<term>Text detection</term>
</keywords>
<keywords scheme="mix" xml:lang="fr"><term>Comparaison d'algorithmes</term>
<term>Détection de texte</term>
<term>Earth mover's distance</term>
<term>Métriques de performance</term>
<term>Protocole d'évaluation</term>
<term>Rectification de texte</term>
<term>Texte en perspective</term>
<term>Visualisation par histogrammes</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">The growing number of text detection approaches proposed in the literature requires a rigorous performance evaluation and ranking. An evaluation protocol relies on three elements: a reliable text reference, a matching strategy and finally a set of metrics. The few existing evaluation protocols often lack accuracy either due to inconsistent matching or due to unrepresentative metrics. In this thesis we propose a new evaluation protocol that tackles most of the drawbacks faced by currently used evaluation methods. This work is focused on three main contributions: firstly, we introduce a complex text reference representation that does not constrain text detectors to adopt a specific detection granularity level or annotation representation; secondly, we propose a set of matching rules capable of evaluating any type of scenario that can occur between a text reference and a detection; and finally we show how we can analyze a set of detection results, not only through a set of metrics, but also through an intuitive visual representation.  A frequent challenge for many Text Understanding Systems is to tackle the variety of text characteristics in born-digital and natural scene images for which current OCRs are not well adapted. For example, texts in perspective are frequently present in real-word images because the camera capture angle is not normal to the plane containing the text regions. Despite the ability of some detectors to accurately localize such text objects, the recognition stage fails most of the time.  In this thesis we also propose a rectification procedure capable of correcting highly distorted texts evaluated on a very challenging dataset.</div>
</front>
</TEI>
<affiliations><list><country><li>France</li>
</country>
</list>
<tree><country name="France"><noRegion><name sortKey="Calarasanu, Stefania" sort="Calarasanu, Stefania" uniqKey="Calarasanu S" first="Stefania" last="Calarasanu">Stefania Calarasanu</name>
</noRegion>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration

HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000013 | SxmlIndent | more

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 000013 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     Hal:tel-01318351
   |texte=   Improvement of a text detection chain and the proposition of a new evaluation protocol for text detection algorithms
}}

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024

	Serveur d'exploration sur l'OCR
	Attention, ce site est en cours de développement ! Attention, site généré par des moyens informatiques à partir de corpus bruts. Les informations ne sont donc pas validées.

Serveur d'exploration sur l'OCR

Improvement of a text detection chain and the proposition of a new evaluation protocol for text detection algorithms

Improvement of a text detection chain and the proposition of a new evaluation protocol for text detection algorithms

Source :

Descripteurs français

English descriptors

Abstract

Links toward previous steps (curation, corpus...)

Le document en format XML

Pour manipuler ce document sous Unix (Dilib)

Pour mettre un lien sur cette page dans le réseau Wicri